The TALP-UPC Approach to System Selection: Asiya Features and Pairwise Classification Using Random Forests
نویسندگان
چکیده
This paper describes the TALP-UPC participation in the WMT’13 Shared Task on Quality Estimation (QE). Our participation is reduced to task 1.2 on System Selection. We used a broad set of features (86 for German-to-English and 97 for English-to-Spanish) ranging from standard QE features to features based on pseudo-references and semantic similarity. We approached system selection by means of pairwise ranking decisions. For that, we learned Random Forest classifiers especially tailored for the problem. Evaluation at development time showed considerably good results in a cross-validation experiment, with Kendall’s τ values around 0.30. The results on the test set dropped significantly, raising different discussions to be taken into account.
منابع مشابه
Selection of Favorite Reusable Launch Vehicle Concepts by using the Method of Pairwise Comparison
The contribution of this paper to the space transportation system field is to select promising Reusable Launch Vehicle (RLV) concepts by using a formal evaluation procedure. The vehicle system is divided into design features. Every design feature can have alternative characteristics. All combinations of design features and characteristics are compared pairwise with each other. The innovation an...
متن کاملTALP-UPC at TREC 2005: Experiments Using a Voting Scheme Among Three Heterogeneous QA Systems
This paper describes the experiments of the TALPUPC group for factoid and ’other’ (definitional) questions at TREC 2005 Main Question Answering (QA) task. Our current approach for factoid questions is based on a voting scheme among three QA systems: TALP-QA (our previous QA system), Sibyl (a new QA system developed at DAMA-UPC and TALP-UPC), and Aranea (a web-based data-driven approach). For de...
متن کاملThe TALP-UPC Approach to Tweet-Norm 2013
This paper describes the methodology used by the TALP-UPC team for the SEPLN 2013 shared task of tweet normalization (Tweet-Norm). The system uses a set of modules that propose different corrections for each out-of-vocabulary word. The final correction is chosen by weighted voting according to each module accuracy.
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملRandom Forests-based Feature Selection for Land-use Classification Using Lidar Data and Orthoimagery
The development of lidar system, especially incorporated with high-resolution camera components, has shown great potential for urban classification. However, how to automatically select the best features for land-use classification is challenging. Random Forests, a newly developed machine learning algorithm, is receiving considerable attention in the field of image classification and pattern re...
متن کامل